Title of dissertation : Collinearity Diagnostics for Complex Survey Data

نویسندگان

  • Dan Liao
  • Richard Valliant
چکیده

Title of dissertation: Collinearity Diagnostics for Complex Survey Data Dan Liao Doctor of Philosophy, 2010 Dissertation directed by: Professor Richard Valliant Joint Program in Survey Methodology Survey data are often used to fit models. The values of covariates used in modeling are not controlled as they might be in an experiment. Thus, collinearity among the covariates is an inevitable problem in the analysis of survey data. Although many books and articles have described the collinearity problem and proposed strategies to understand, assess and handle its presence, the survey literature has not provided appropriate diagnostic tools to evaluate its impact on the regression estimation when the survey complexities are considered. The goal of this research is to extend and adapt the conventional ordinary least squares collinearity diagnostics to complex survey data when a linear model or generalized linear model is used. In this dissertation we have developed methods that generally have either a model-based or design-based interpretation. We assume that an analyst uses surveyweighted regression estimators to estimate both underlying model parameters (assuming a correctly specified model) and census-fit parameters in the finite population. Diagnostics statistics, variance inflation factors (VIFs), condition indexes and variance decomposition proportions are constructed to evaluate the impact of collinearity and determine which variables are involved. Survey weights are components of the diagnostic statistics and the estimated variances of the coefficients are obtained from design-consistent estimators which account for complex design features, e.g. clustering and stratification. Illustrations of these methods are given using data from a survey of mental health organizations and a household survey of health and nutrition. We demonstrate that specialized collinearity diagnostic statistics are needed to account for survey weights and complex finite population features that are reflected in the sample design and considered in the regression analysis. Collinearity Diagnostics for Complex Survey Data

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Condition indexes and variance decompositions for diagnosing collinearity in linear model analysis of survey data

Collinearities among explanatory variables in linear regression models affect estimates from survey data just as t hey do in non-survey data. Unde sirable effects are unnecessarily inflated standard err ors, spuriously low or high t-statistics, and parameter estimates with illogical signs. The available collinearity diagnostics are not generally appropriate for survey data because the variance ...

متن کامل

Where’s Waldo? Visualizing Collinearity Diagnostics

Collinearity diagnostics are widely used, but the typical tabular output used in almost all software makes it hard to tell what to look for and how to understand the results. We describe a simple improvement to the standard tabular display, a graphic rendition of the salient information as a ‘‘tableplot,’’ and graphic displays designed to make the information in these diagnostic methods more re...

متن کامل

Variance inflation factors in the analysis of complex survey data

Survey data are often used to fit linear regression models. The values of covariates used in modeling are not controlled as they might be in an experiment. Thus, collinearity among the covariates is an inevitable problem in the analysis of survey data. Although many books and articles have described the collinearity problem and proposed strategies to understand, assess and handle its presence, ...

متن کامل

Collinearity and Least Squares Regression

abstract In this paper we introduce certain numbers, called collinearity indices, which are useful in detecting near collinearities in regression problems. The coeecients enter adversely into formulas concerning signiicance testing and the eeects of errors in the regression variables. Thus they provide simple regression diagnostics, suitable for incorporation in regression packages.

متن کامل

Linear Regression Diagnostics in Cluster Samples

An extensive set of diagnostics for linear regression models has been developed to handle nonsurvey data. The models and the sampling plans used for finite populations often entail stratification, clustering, and survey weights, which renders many of the standard diagnostics inappropriate. In this article we adapt some influence diagnostics that have been formulated for ordinary or weighted lea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010